Cognitive Speech Coding

نویسندگان

  • Milos Cernak
  • Afsaneh Asaei
  • Alexandre Hyafil
چکیده

Speech coding is a field where compression paradigms have not changed in the last 30 years. The speech signals are most commonly encoded with compression methods that have roots in Linear Predictive theory dating back to the early 1940s. This paper tries to bridge this influential theory with recent cognitive studies applicable in speech communication engineering. This tutorial article reviews the mechanisms of speech perception that lead to perceptual speech coding. Then it focuses on human speech communication and machine learning, and application of cognitive speech processing in speech compression that presents a paradigm shift from perceptual (auditory) speech processing towards cognitive (auditory plus cortical) speech processing. The objective of this tutorial is to provide an overview of the impact of cognitive speech processing on speech compression and discuss challenges faced in this interdisciplinary speech processing field. In this context, it covers the traditional speech coding techniques as well as emerging approaches facilitated by deep learning computational methods. The tutorial points out key references on fundamental teachings of psycholinguistics and speech neuroscience and provides a valuable background to beginners and practitioners on the promising directions of incorporating principles of cognitive speech processing in speech compression.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical parametric speech synthesis with a novel codebook-based excitation model

Speech synthesis is an important modality in Cognitive Infocommunications, which is the intersection of informatics and cognitive sciences. Statistical parametric methods have gained importance in speech synthesis recently. The speech signal is decomposed to parameters and later restored from them. The decomposition is implemented by speech coders. We apply a novel codebook-based speech coding ...

متن کامل

The Role of L2 Private Speech in Cognitive Regulation of Adult Foreign Language Learners

The present study investigated the use of L2 private speech by English foreign language (EFL) learners in regulating their mental activities. Thirty intermediate adult EFL learners took a test of solving challenging English riddles while their voices were being recorded. Following, instances of the produced private speech were analyzed in terms of form, content, and function. Numerous instances...

متن کامل

Effects of sound pillow in the treatment of stuttering and cognitive phonemes impairment in children

Introduction:Verbal language is Fundamental component for expressing ideas, social interaction and understanding educational materials. Effective communications require verbal language skills. Sound pillows may partly address the children with behavior problems. The purpose of this study was assessing the effect of educational sound pillow in the treatment of stuttering and cognitive phonemes i...

متن کامل

Achievable Secrecy Rate Regions of State Dependent Causal Cognitive Interference Channel

In this paper, the secrecy problem in the state dependent causal cognitive interference channel is studied. The channel state is non-causally known at the cognitive encoder. The message of the cognitive encoder must be kept secret from the primary receiver. We use a coding scheme which is a combination of compress-and-forward strategy with Marton coding, Gel’fand-Pinsker coding and Wyner’s wire...

متن کامل

1 1 2 3 A mutual information analysis of neural coding of speech by low 4 frequency MEG phase information 5

3 A mutual information analysis of neural coding of speech by low 4 frequency MEG phase information 5 Gregory B. Cogan & David Poeppel 6 7 1 Neuroscience and Cognitive Science, University of Maryland College Park 8 2 Department of Psychology, NYU 9 3 Center for Neural Science, NYU 10 11 Running Head: Mutual Information and MEG Phase 12 13 Address for Correspondence 14 Gregory B. Cogan 15 Depart...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016